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Given a complex network, its L-paths correspond to sequences of L + 1 distinct nodes connected 
through L distinct edges. The i-conditional expansion of a complex network can be obtained by 
connecting all its pairs of nodes which are linked through at least one L-path, and the respective 
conditional i-expansion of the original network is defined as the intersection between the original 
network and its L-expansion. Such expansions are verified to act as filters enhancing the network 
connectivity, consequently contributing to the identification of communities in small-world models. 
It is shown in this paper for L — 2 and 3, in both analytical and experimental fashion, that an 
evolving complex network with fixed number of nodes undergoes successive phase transitions - the 
so-called //-percolations, giving rise to Eulerian giant clusters. It is also shown that the critical 
values of such percolations are a function of the network size, and that the networks percolates for 
L — 3 before L = 2. 

PACS numbers: 89.75.Fb, 02.10.Ox, 89. 75. Da, 87.80.Tq 



I. INTRODUCTION 



One of the most remarkable properties of complex net- 
works is their tendency to undergo a topological phase 
transition (i.e. percolation) as the number of connec- 
tions is progressively increased 0, 0, • Several systems, 
such as the world wide web and epidemics dissemination, 
only become particularly interesting near or after perco- 
lation. Although some works have investigated phase 
transitions in dynamical systems underlain by complex 
networks (e.g. 0,0), here we focus our attention on topo- 
logical critical phenomena in such networks. In addition 
to the classical studies by Erdos and Renyi, more re- 
cent works addressing percolation in networks include the 
analysis of the stability of shortest paths in complex net- 
works [|| , the investigation of percolation in autocatalytic 
networks , and the study of the fractal characterization 
of complex networks || . 

The concept of L-percolations is based on the L- 
expansions and L-conditional expansions of a complex 
network, which are introduced in the current paper. Con- 
sider the two-subsequent edges in Figure Qfa). The fact 
that node 2 is indirectly connected to node 3 through 
a self-avoiding 2-path passing through node 1 defines a 
virtual link between nodes 2 and 3 [jj, represented by 
the dotted line in that figure. In case such a virtual 
edge does belong to the network, it defines a cycle of 
length 3 between the three involved nodes, as illustrated 
in Figure H^b), which contains three distinct undirected 
2-paths. The 'self-avoiding' requirement for the paths 
is imposed in order to avoid passing twice through the 
same edge, which would tend to produce an infinite num- 
ber of paths between any pair of connected edges. For 
instance, in case this restriction is not considered, there 
would be an infinite number of even-length paths (i.e. 
L = 3,5,7,...) from node 1 to 2 in Figure da), while 
just the direct /-path between those nodes remain after 



imposing the self- avoiding requirement. Such a restric- 
tion therefore allows a clearer characterization of the dis- 
tribution of path lengths and connectivity in a complex 
network. 

The specific demands governing the network growth 
may imply that cycles such as that in Figure IHb) oc- 
cur sooner or later along the network evolution. For in- 
stance, intense indirect (i.e. through node 1) information 
exchange between nodes 2 and 3 is likely to foster the ap- 
pearance of the direct link between those two nodes [?|. 
In other words, the formation of such cycles can be un- 
derstood as a reinforcement of the connectivity between 
the involved nodes, possibly implied by intensive infor- 
mation interchange and/or node affinity. Therefore, the 
density of 5-cycles (as well as cycles similarly defined for 
other values of L) is likely to provide interesting insights 
about the growth dynamics and connectivity properties 
of complex networks. This is one of the main motivations 
for the investigations reported in the present article, with 
implications to the identification of communities in the 
analyzed networks, as described below. Another impor- 
tant aspect intrinsic to the definition of L-paths is that 
such a property is directly related to the transitivity of 
connections along the network. In other words, if node 
i\ is connected to node which is connected to node 
and so on, until i l , the eventual presence of the therefore 
established virtual link extending from i\ to i^ can be un- 
derstood as an indication of transitivity in the network 
connectivity. 

It is worth observing that a 3-cycle can also be re- 
lated to a hyperedge between the three involved nodes, 
as illustrated in Figure HTc). Such a kind of edge is char- 
acteristic of hypergraphs 1 1 1 L Il2j |. whose nodes are 
connected through hyperedges. In other words, a hyper- 
edge is a relationship defined at the same time between 
more than two nodes. The essential difference between 
the 3-cycle shown in Figure ^b) and the hyperedge in 
(c) is the fact that the 3-cycle can be dismantled by re- 
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(a) 

FIG. 1: A pair of subsequent edges defining a virtual link (~^~\ fT) 

between nodes 2 and 3 (a), the basic cycle of size 3 underlying C-^vfTX 
^-percolations (b) and its relationship with the concept of r~<^~" / "\)~\ 

hyperedge (c) . v-*tT / ~^y-^ 



moving any of the involved edges while the hyperedge 
implies simultaneous deletion of all connections (i.e. a 
hyperedge is a single edge and can only be removed as a 
whole). In other words, a 5-hyperedge intrinsically im- 
plies a 5-cycle, but not the other way round. Still, situa- 
tions where the (L+l) -cycles are closely interdependent 
can be understood in terms of hyperedges. 

Given a complex network T with N nodes, its L- 
expansion is henceforth defined as the new network, also 
with N nodes, which is obtained by incorporating an 
edge between two nodes i and j whenever there exists 
a virtual link of length L (i.e. a self-avoiding path of 
length L) between those two nodes. Observe that such a 
definition holds for both directed and undirected graphs. 
Such expansions can be implemented by considering or 
not multiple links between two nodes, leading to different 
results. The L-expansion of a network can be intersected 
with the original network Y in order to obtain the L- 
conditional expansion of that network. Such a network, 
which also involves N nodes, contains all cycles of length 
L+l in the original network and no node with null or 
unit degree. Actually, all connections in such a network 
belong to cycles of length L + l. In case L = 2, the re- 
spective L-conditional expansion is composed exclusively 
of 5-cycles, and can therefore be related to a respective 
5-hypergraph contained in the original network. Figure[21 
illustrates a simple undirected graph (a) and its respec- 
tive 2- (b) and 5-expansions (c). The 2- and 5-conditional 
expansions of the graph in Figure |2Ia) are shown in (d) 
and (e). 

Figure 13 shows a simple network (a) and its respective 
^-conditional expansion, which involves only 5-cycles. 
Observe that the conditional percolation removes all the 
edges not involved in 5-cycles, so that the obtained clus- 
ters represent a strong backbone of the original net- 
work. Observe that any single edge can be removed from 
the conditionally-expanded network without producing 
a new cluster, which is true for any L > 2. At the 
same time, the conditional expansion tends to enhance 
the clustering coefficient [jj of the obtained network. For 
the specific case L = 2, the conditional expansion also 
tends to enhance the regularity |2(il | of the network, in 
the sense that all nodes in the resulting network have 
degree equal to an integer multiple of 2. 
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FIG. 2: A simple graph (a) and its respective 2- (b) and 
3-expansions (c). The respective 2- and 3-conditional expan- 
sions are shown in (d) and (e). The completeness ratios are 
f 2 = 0.43 and / 3 = 0.86. 



Conditional expansions can be thought of as a kind of 
network filter which removes edges in order to preserve 
those groups of nodes more densely connected through 
L-paths, therefore characterizing a tendency of the con- 
ditional expansion to preserve the subgraphs contained 
in small- world models Q, ^| . This fact makes the con- 
ditional expansions an interesting mechanism for identi- 
fying communities in such a kind of complex networks, a 
possibility which is preliminarly investigated in the cur- 
rent work (see Section ITTT)l . Observe that the hubs, i.e. 
nodes with high degree are not necessarily preserved by 
conditional expansions. For instance, although the hub 
marked as a in Figure|3|was retained, the one marked as 
b was eliminated by the 5-conditional expansion. 

The progressive addition of new edges into a random 
network inevitably leads to the appearance of a giant 
cluster, which dominates the network henceforth. Such a 
phenomenon, corresponding to a topological phase tran- 
sition (i.e. percolation) |14| of the network, has motivated 
much interest and bridged the gap between graph theory 
and statistical mechanics. As shown in this article both 
in analytical and experimental fashion, the uniform evo- 
lution of a random network (and also of a preferential- 
attachment model) naturally leads not only to the tradi- 
tional percolation, but also to successive L-percolations, 
which are characterized by the fact that a giant clus- 
ter appears in the respective L-conditional expansions. 
For instance, as the number of edges is progressively in- 
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(b) 

FIG. 3: A simple network (a) and its ^-conditional expansion 
represented in terms of cycles/hyperedges. 

creased, one reaches a point where a giant cluster is es- 
tablished in the original network where each of its nodes 
is connected to at least another node not only through a 
direct connection, but also through a self-avoiding path 
of length L. This is the main property characterizing the 
giant clusters for generic L-percolations. For instance, 
in case L = 2, the giant cluster is characterized by the 
fact that each of its edges is part of a 5-cycle. In other 
words, every edge of the giant cluster can be associated 
to a ■S-hyperedge, thus emphasizing the fact that the con- 
nections of such a ,2-giant cluster are stronger than the 
traditional case (i.e. L = 1). The hypergraph in Fig- 
ure [3] provides a simplified illustration of the connections 
characterizing the giant cluster in a ^-percolated com- 
plex network. It should be noticed that the giant cluster 
obtained in an L-conditional expansion has the unique 
characteristic that it can not be modified by a subsequent 
conditional expansion for the same value L, a property 
called idempotency. 

The concept of L-percolations allows a series of insights 
about the analyzed networks, including: 

Theoretical features: As one of the main interests in 
complex networks is related to the occurrence of critical 
topological transitions, the identification of further per- 
colations of a growing network represents an important 
fact by itself, indicating that the abrupt changes of the 
network properties, namely connectivity, is not restricted 



to 1-connectivity, but extends over several values of L. 
Therefore, the dynamics of network connectivity forma- 
tion is verified to be richer than usually considered in 
complex network theory. 

Identification of the strongest connections: The occur- 
rence of an L-percolation indicates a reinforcement be- 
tween most of the connections in a network as far as L- 
paths are concerned. In other words, the fact that a giant 
cluster is obtained for a specific value of L indicates that 
most of the network nodes are connected through closed 
L-paths (i.e. most belong to cycles), therefore exhibiting 
enhanced connectivity for that value, in the sense that 
several links may be removed before the percolated clus- 
ter collapses. 

Network characterization: Several measurements re- 
lated to the concept of L-connectivity can be defined 
and used to characterize the connectivity properties of 
complex networks (such as transitivity), as illustrated in 
the last Section of this article. Taken in combination 
with traditional features such as the average node degree 
0- Q ■ such measurements allow a more complete char- 
acterization of the connectivity features of the network 
under analysis. 

Community finding: The clusters obtained by the L- 
conditional expansions are characterized by enhanced av- 
erage clustering coefficient while presenting a tendency to 
remove links between loosely connected groups of nodes, 
suggesting the division of the original network into mean- 
ingful communities (e.g. 0,0])- In other words, the 
L-conditional expansions of a network may help the iden- 
tification of the involved communities, as only the more 
intensely connected nodes (i.e. those characterized by L- 
paths) will remain after the conditional expansion. This 
possibility is preliminarly illustrated in Section lTTTl of this 
article. 

Eulerian networks: In case the conditional expansions 
are performed allowing for multiple links between two 
nodes, it is shown in this paper that the giant clusters 
underlying the L-percolations, and therefore contained 
in the orig inal network, are necessarily Eulerian M ^ 
other words, all nodes of the giant cluster defined by an 
L-percolation can be visited without passing twice over 
the same edge. Such a property has interesting impli- 
cations for several types of complex networks, including 
communication networks and protein sequence networks 
(e.g. H3)i as wen as f° r random walk investigations [l8| . 



II. DEFINITIONS 

Let the nodes of the network of interest T be repre- 
sented as k = 1, . . . , N, and the edges as ordered pairs 
(i,j). The total number of edges in the network is ex- 
pressed as n(T). For generality's sake, the network T 
can be fully represented by its weight matrix W, where 
no self-connections are allowed. The weight matrix is 
obtained by assigning the weight of each edge to 
the respective weight matrix element w(j,i). Such a 
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representation of the network in terms of a weight ma- 
trix is more general than the traditional adjacency ma- 
trix, as it allows the representation of the weights asso- 
ciated to the network edges. To any extent, the adja- 
cency matrix A of a graph can be obtained by making 
its elements a(J,i) = 1 whenever there is an edge (of 
any non-zero weight) between nodes i and j. The av- 
erage node degree of T is henceforth represented as z. 
Let St(x) be the operator acting elementwise over the 
matrix x in such a way that the value one is assigned 
whenever the respective element of x has absolute value 
larger or equal than the specified threshold T; for in- 
stance, 5 2 (x = (3,-2,0,-4,1)) = (1,1,0,1,0). Thus, 
the adjacency matrix can be obtained from the weight 
matrix as A = 5t(W). 

The random and preferential-attachment models were 
obtained as described in the following. For the random 
network case, the growing parameter corresponds to the 
Poisson rate (the mean density of connections) p, and in 
the preferential attachment case, to the number c of ex- 
isting connections in the current stage of growth, which 
are normalized in terms of the respective average node 
degree as zr = p(N — 1) and z$f = 2c /N respectively, 
for the sake of direct comparison between the two net- 
work types. The preferential-attachment network growth 
was performed by keeping a list where each network node 
with current degree k appears k times, so that subse- 
quent edges can be defined by selecting pairs of distinct 
elements in such a list according to the uniform distribu- 
tion [Tgj . 

One can check for an L-path between two nodes i and 
j by checking the sequences of successors for each tran- 
sition along the graph, until either a path is found or all 
the paths have been tested (the so-called depth-first scan- 
ning of the graph from node i). Only successor nodes not 
already included in the constructing path are considered 
during this procedure. Observe that, in the particular 
case of L — 2, the respective paths can be immediately 
obtained from the squared adjacency matrix ignoring its 
main diagonal. 

The L-expansions and L-conditional expansions are 
henceforth represented as Ti and Hi,, respectively. In- 
teresting information about the intrinsic topology of the 
original network T can be obtained from their mea- 
surements of the respective expansions. Some simple 
possibilities are the number of edges in each of these 
graphs, henceforth expressed as n(Y l) and n(S^). Thus, 
Jl = n(Si)/n(r), hence the completeness ratio of the 
network T for L, corresponds to the ratio between the 
number of edges belonging to (L + 1)— cycles and the 
total number of edges in the original network T. For in- 
stance, in case all edges in T are part of a 5-cycle, we 
have /2 = 1, while smaller values are obtained for less 
transitive networks. 

An interesting property of the giant clusters defined by 
the L-percolations when multiple links are allowed is the 
fact that they are necessarily Eulerian. More specifically, 
such multiple edges are implemented as follows: given the 
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FIG. 4: A simple network obtained after ^-conditional ex- 
pansion (a) and the duplication of its shared edges in order 
to warrant the Eulerian property. 



^-conditional expansion of the original network, duplicate 
each edge which is shared by two cycles, as illustrated in 
Figure This interesting property can be easily proved 
by considering that all nodes in such a network will have 
even degree, which is a necessary and sufficient condition 
for the cluster to be Eulerian Hal. 



III. CONDITIONAL EXPANSIONS AND 
COMMUNITY FINDING 

This section reports a preliminarly investigatation of 
the potential of L-conditional expansions as a subsidy 
for community finding. We already observed in Sec- 
tion [i] of this article that such expansions tend to pre- 
serve the more strongly connected subnetworks or com- 
munities. Figure EJa) shows an initial network contain- 
ing 3 communities of 20 nodes, each corresponding to a 
random network whose edges were added according to 
uniform probability with Poisson rate 0.3. By adding 
random edges between these 3 groups, small- world net- 
works such as that shown in Figure[S^b) can be obtained. 
The potential of the L-conditional expansions for isolat- 
ing the original communities is illustrated in Figurc^c), 
which shows the network obtained by performing the 2- 
conditional expansion over the network in Figure [SJb). 
Although missing some of the original connections, the 
three clusters obtained by the conditional expansion cor- 
respond to the original communities. The connections 
inside each obtained community can be partially com- 
plemented by incorporating those edges in Figure \5fb) 
which connect nodes inside each of the identified clusters. 
This type of link is henceforth called an intracommunity 
edge, while links between nodes in different communities 
are called intercommunity edges. The effect of this pro- 



5 



cedure is illustrated in Figure with respect to the three 
communities identified in Figure Efc). Although a total 
of 19 edges were recovered, the disconnected node at the 
upper left corner of the network remained isolated from 
its original community. 

We have verified through experimental simulations 
that, for a relatively small number of added edges (Pois- 
son rate about 0.015 for networks such as that in Fig- 
ure |5j) , the original communities can be reasonably es- 
timated by applying the ^-conditional expansion. Sit- 
uations where some intercommunity edges are left by 
such expansions can be addressed by using the Newman 
and Girvan's method after the respective ^-conditional 
expansion. However, situations involving a substantial 
number of random intercommunity edges, as when the in- 
tercommunity and intracommunity edges are established 
with similar probabilities, arc likely to produce incorrect 
communities. 



IV. X-PERCOLATIONS 

The L-percolations of random networks have been in- 
vestigated from both the analytical and experimental 
points of view, as described in the following. Preferential- 
attachment networks were also considered, but only ex- 
perimentally. 



A. Analytic Mean-Field Calculations 

The following mean-field approximation assumes that 
the edge assignment takes place independently of the 
node degree or other network features, being more ac- 
curate for not extremely dense networks (see below) . Let 
F be a random network with N nodes and Poisson rate 
p (defining the overall edge density), implying the maxi- 
mum number of edges to be tit — N(N — l)/2. It imme- 
diately follows that the average number of expected edges 
is n = priT and that the average node degree therefore 
is z — 2n/N — p(N — 1). We adopt a mean field ap- 
proach considering each node i. As the expected number 
of nodes connected to i is z, we have that the maxi- 
mum number of 2-paths involving the edges (k, i); (i,p) 
established between any two nodes k and p connected 
to node i is z(z — l)/2, so that the expected number of 
direct connections between them is pz(z — l)/2 and the 
expected total number of 2-paths is na = Npz(z — l)/2, 
implying the respective ^-conditional expansion of the 
original graph to have Poisson rate P2 as given by Equa- 
tion ^ Now, since the ^-conditional expansion of Y is 
also a random network (recall that the edge assignment 
takes place independently of node degree or other current 
network property), we obtain the critical point p\ where 
the respective percolation takes place by considering that 
the node degree of the ^-conditional expansion, namely 
z%(2 — ce), reaches unit value at that critical transition 
(see Equation |2J) • Interestingly, this critical point is ver- 




(c) 

FIG. 5: Illustration of how the conditional expansion of a 
network can act as a filter enhancing the communities in 
small-world types of networks: (a) original network contain- 
ing three communities; (b) network obtained by addition of 
random connections between communities; and (c) partial 
identification of the three original communities obtained by 
^-conditional expansion of the network in (b). 
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FIG. 6: Complemented communities obtained by incorporat- 
ing edges from the network in Figure QJb) into the three re- 
spective clusters in Figure QJc). 



ified to grow as a power of N. The respective average 
degree of the original network at this critical point can 
be estimated as in Equation [21 



Next we address the case L = 3 by considering each 
edge e connecting two nodes i and j of average degree 
z. The total of direct connections between the distinct 
nodes k and p connected respectively to i and j, defining 
3-paths (k,i); (j,p) between nodes k and p, there- 
fore is given as z 2 , and the expected number of such 
directed links per edge therefore is (pz) 2 . The total 
number of direct links between the pairs of nodes con- 
nected through 3-paths passing through edge e therefore 
is tt.3 = riT(pz) 2 = Np i (N — l) 3 /2, implying respective 
Poisson rate as in Equation Observe that, in case 
the connections become too dense, the number of closed 
paths such as (k, i); (J, k), instead of the assumed 
open paths (k,i); (j,p), may become too frequent 
and undermine the above estimations for L = 3. As 
the 5-conditional expansion of the original random net- 
work is also assumed to be a random network, the as- 
sociated critical rate value can now be calculated as in 
Equation [S] The corresponding average degree of the 
original network at this critical point is given by Equa- 
tion |SJ Interestingly, it follows that p§ < p 2 for any 
N > 1, i.e. the 3-conditional expansion is expected to 
percolate sooner than the ^-conditional expansion. We 
also have that f 2 = p 2 (N - 1), f 3 = p 3 (N - l) 2 , and 
h/h = P3/P2 = z. 
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B. Simulation Results 

The experimental analysis was performed by monitor- 
ing the normalized maximum cluster size M(z) for each 
instance of the growing networks for a total of 300 realiza- 
tions of each configuration. Figure0shows the maximum 
cluster sizes obtained for random and preferential attach- 
ment networks for N = b 2 , where b = 3,4, . . . , 15. The 
preferential attachment cases where generally character- 
ized by smoother transitions than their random counter- 
parts. As expected, the 2- and 5-conditional expansions 
percolated later than the original network (L = 1), with 
the 3-conditional expansions of the original network per- 
colating sooner and more abruptly than the respective 
^-conditional expansions. Larger dispersions were ob- 
served for the maximum cluster sizes of the random net- 
works, indicating that their specific realizations are less 
uniform. As theoretically predicted, the critical average 
node degree z c resulted a function of the network size 
N in both the random and preferential-attachment mod- 
els, but such a dependency was markedly less intense for 
the preferential-attachment model. The dilog diagram 
in Figure |H1 shows the theoretical predictions (solid and 
dashed lines) and the values corresponding to 80% of the 
critical average node degrees. More specifically, the ver- 
tical axis represents the logarithm of the critical average 
node degree corresponding to 80% of the value of z for 
which the maximum dispersion of M(z) (the vertical bars 
in FigureEJ) is observed in the respective simulation. Ob- 
serve that a total of 13 simulation sets (executed for 200 
realizations considering each specific values of N, as iden- 
tified in the x-axis of Figure [SJ, and not only the three 
cases illustrated in the first row of Figure were consid- 
ered in order to obtain the results shown in Figure |HI A 
good agreement between analytic and experimental re- 
sults is verified for both L — 2 and 3. 

The completeness ratios for the random and 
preferential-attachment have also been estimated in our 
simulations, and the results for N = 25, 100 and 225 
are shown in Figure E3 It is clear from this figure that, 
as expected, higher completeness ratios were always ob- 
tained for the 3-conditional expansions than for the 2- 
conditional expansions, substantiating the fact that 3- 
conditional expansions tend to produce networks with 
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more edges than those obtained for ^-conditional expan- 
sions. At the same time, the preferential-attachment 
models were characterized by completeness ratios which 
grow faster with z than for the random model, indicat- 
ing that this type of complex network tends to produce 
a higher number of edges belonging to 3- and ^-cycles. 
Such a result is compatible with the fact that random 
networks tend to percolate faster than the preferential- 
attachment model because the completeness ratio only 
takes into account the number of existing edges in the 
original network. For instance, a network containing 10 
isolated 3-cycles will have /2 = 1 while a network with 
a single cycle containing 30 nodes will imply fa = 0. 
Therefore, while the size of the dominant cluster tend 
to grow slower in preferential attachment networks, the 
clusters in such cases have a higher number of edges be- 
longing to 3- or ^-cycles than those in random networks, 
as expressed in Figure OH 



V. APPLICATION TO REAL DATA 

In order to illustrate the potential of the concepts and 
measurements suggested above, they have been applied 
to experimental data regarding concept associations ob- 
tained in the psychophysical experiment described in , 
where a human subject is requested to associate words. 
After 1930 word associations were obtained, a weighted 
directed graph (3 was determined by representing each of 
the 250 words as nodes and the number of specific asso- 
ciations between two words as the weight of the edge be- 
tween the respective nodes. The respective average node 
seemed to follow a power law. Here we consider the ad- 
jacency matrix A — St=i(B + B'), where B = St=i(W), 
B' is the transpose of B and W is the weight matrix 
obtained in the concept association experiment. In other 
words, the original weight matrix is thresholded at T = 1 
and made symmetric in order to transform the original 
digraph into a graph. Matrix B was characterized by 
n = 738 edges, p = 0.011 and z = 5.9. The 2- and 3- 
conditional expansions of A were obtained as described 
above, leading to n-i — 407 and — 606 connections, 
P2 = 0.0067 and pz — 0.0092, with respective average 
node degrees of 4.5 and 5.7. Thus, we have fi = 0.55 
and /3 = 0.82 indicating, as expected, a higher density 
of 4~ than 5-cycles in the original network. Such high 
densities also imply that about half of the edges in the 
original network belong to 5-cycles, while over 80% of 
the edges belong to ^-cycles. As a matter of fact, the 
fact that these values exceed the respective critical val- 
ues z c for N = 250 (see Figure |SJ) strongly suggest that 
the word association network has already undergone per- 
colation as well as 2- and 5-percolations. Such results 
indicate that the word association network is underlain 
by 3- and ^-cycles formed as a consequence of transitive 
nature of word associations. 

An interesting question implied by the above results 
regards the quantification of the degree of transitivity in 
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water, drink, cold, soda, ice 


2 


round, hole, circle, square, table 


3 


animal, pig, cat, tail 


4 


Mary, John, man, woman 


5 


much, good, better, work 



TABLE I: Some of the communities identified by 3- 
conditional expansions applied to the word association ex- 
periment. Observe that such communities are characterized 
by the tendency of the respective words to form ^-cycles such 
as water i— > drink t— > soda i— > cold i— > water. 

a complex network in terms of the completeness ratios 
Jl . Although it is hard to establish a critical limit where 
networks can be said to be transitive or not, the degree of 
transitivity implied by the connections in a given complex 
network can be at least partially quantified in terms of 
the completeness ratios Jl ■ This follows directly from the 
own definition of this measurement as the ratio between 
the number of edges in the L-conditionally expanded net- 
work, which correspond to the number of edges belonging 
to transitive cycles of length L + 1, and the number of 
edge in the original network. The completeness ratios 
fi and /3 obtained for the word association experiment 
clearly indicate a high degree of transitivity for L = 2 
and 3, with over 50% of the edges obtained by 2- and 
5-conditional expansions belonging to respective cycles. 

Because of the high level of connectivity underlying 
the word association data, attempts to isolate commu- 
nities by using 2- and 5-conditional expansions led to 
a single dominant community and several rather small 
clusters. However, by thresholding the original weight 
matrix at T = 2 instead of T — 1, i.e. by making 
B = <5t=2(WO, a number of interesting communities were 
obtained by applying the 3-conditional expansion over 
the matrix A — 5t=i(B + B'). Table[I]shows some of the 
so obtained communities, which are characterized by the 
fact that the respective constituent words tend to form 
^-cycles such as water t— > drink i— > soda t— > cold i— > wa- 
ter, therefore leading to longer range correlations. The 
^-conditional expansion of A led to just two communities 
of three words each. 



VI. CONCLUDING REMARKS 

By introducing and characterizing the concepts of L- 
conditional expansions, and by showing that successive 
-L-percolations can be associated to a complex network, 
the current paper has opened a series of possibilities for 
further investigations, including not only the considera- 
tion of higher values of L and other evolution models, 
but also the use of the introduced concepts as a means 
to identify communities in the original network. As il- 
lustrated for the word association experiment discussed 
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FIG. 7: The giant cluster sizes for the random and preferential attachment networks and respective conditional expansions in 
terms of z for N = 25, 100 and 225 and L = 1 (o), L = 2 (+) and L = 3 (x). 




ln(N) 
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FIG. 8: Dilog presentation of analytical predictions (solid line: 
L — 4; dashed line: L = 3) and experimental values (x: 
L — 2; *: L — 3) of critical average node degrees in terms of 
N. 



in this work, the introduced methodology also presents 
good potential for characterizing real phenomena. 
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